Relevant values: New metadata to provide insight on attribute values at schema level

نویسندگان

  • Sonia Bergamaschi
  • Mirko Orsini
  • Francesco Guerra
  • Claudio Sartori
چکیده

Research on data integration has provided languages and systems able to guarantee an integrated intensional representation of a given set of data sources. A significant limitation common to most proposals is that only intensional knowledge is considered, with little or no consideration for extensional knowledge. In this paper we propose a technique to enrich the intension of an attribute with a new sort of metadata: the “relevant values”, extracted from the attribute values. Relevant values enrich schemata with domain knowledge; moreover they can be exploited by a user in the interactive process of creating/refining a query. The technique, fully implemented in a prototype, is automatic, independent of the attribute domain and it is based on data mining clustering techniques and emerging semantics from data values. It is parametrized with various metrics for similarity measures and is a viable tool for dealing with frequently changing sources, as in the Semantic Web context.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An ontology based approach to the integration of entity-relationship schemas

In schema integration, schematic discrepancies occur when data in one database correspond to metadata in another. We explicitly declare the context that is the meta information relating to the source, classification, property etc of entities, relationships or attribute values in entity-relationship (ER) schemas. We present algorithms to resolve schematic discrepancies by transforming metadata i...

متن کامل

A New Type of Metadata for Querying Data Integration Systems

Research on data integration has provided languages and systems able to guarantee an integrated intensional representation of a given set of data sources. A significant limitation common to most proposals is that only intensional knowledge is considered, with little or no consideration for extensional knowledge. In this paper we propose a technique to enrich the intension of an attribute with a...

متن کامل

Word count 3971

Objective To design, build and evaluate a storage model able to manage heterogeneous DICOM images. The model must be simple, but flexible enough to accommodate variable content without structural modifications; must be effective on answering query/retrieval operations according to the DICOM standard, and must provide performance gains on querying/retrieving content to justify its adoption by im...

متن کامل

I-28: Gamete Donation from An Islamic View

One of the distinction between an ethical proposition and a proposition in Islamic Jurisprudence (Feqh) or Islamic Law is in their predicates. All three speak of the free action of human being as their subjects. Nature of human being discussed in this article according to Islamic point of view. Status of a gamete and nature of human being correlated to each other and affect to the conclusion. H...

متن کامل

A Comprehensive System for Computer-aided Metadata Generation

In this paper, we describe a system that generates suggested values for metadata elements. The system significantly increases the productivity of metadata creators as well as the quality of the metadata. The system is applicable to any metadata standard both for single metadata records and collections of related metadata records. Instead of aiming for automated metadata generation we have devel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007